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O ' Abstract 

w 

^ ■ We present a first morphometric investigation of a preliminary sample from the 

SDSS of 154287 galaxies with apparent magnitude 14.5 < rrij. < 17.5 and redshift 
I 0.001 < z < 0.4. We measure the Minkowski Functionals, which are a complete set 

^ I of morphological descriptors. To account for the complicated wedge-like geometry 

of the present survey data, we construct isodensity contour surfaces from the galaxy 
positions in redshift space and employ two complementary methods of computing the 
Minkowski Functionals. We find that the observed Minkowski Functionals for SDSS 
galaxies are consistent with the prediction of a A-dominated spatially-fiat Cold Dark 
Matter model with random-Gaussian initial conditions, within the cosmic variance 
estimated from the corresponding mock catalogue. We expect that future releases of 
the SDSS survey will allow us to distinguish morphological differences in the galaxy 
distribution with regard to different morphological type and luminosity ranges. 

Key words: cosmology: large-scale structure of universe — cosmology: observa- 
tions — methods: statistical 
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1. Introduction 



The first-year WMAP ( Wilkinson Microwave Anisotropy Probe) data allow measure- 
ment of the parameters of the standard cosmological model to unprecedented accuracy; it now 
seems reasonable to assume fl^a ~ 0.3, ~ 0.7, and h ^ 0.7 (Spergel et al. 2003). Furthermore, 
the data put stringent constraints on non-Gaussian signatures in the primordial density fluc- 
tuations at z ^ 1000 (Komatsu et al. 2003; CoUey & Gott 2003). These results point to two 
important and complementary questions for cosmology in this century: origin and evolution. 
The former regards the question of how our universe came to have the observed values of the 
cosmological parameters including questions of their intrinsic nature. The latter question asks 
how the universe at z ~ 1000 revealed by WMAP leads to the rich current structure traced by 
galaxies. The present study is an attempt to address the latter by characterizing and quantify- 
ing the morphological properties of the galaxy distribution of Sloan Digital Sky Survey (SDSS) 
spectroscopic catalogues through estimation of the Minkowski Functionals (Mecke et al. 1994; 
Schmalzing and Buchert 1997) . 

We have three primary goals for analyzing galaxy catalogues with Minkowski Functionals 
(hereafter MFs): the first is to test Gaussianity of the primordial density fluctuations. Indeed, 
tests of Gaussianity used to be of primary importance before the discovery of the CMB (Cosmic 
Microwave Background) temperature anisotropy, but now have been superseded by the CMB 
map analysis, although measurement of the MFs is still useful as an independent tool. The 
second goal is to understand the evolution of galaxy clustering, with particular emphasis on 
the morphology of large-scale structure. Since the primordial Gaussianity has been fairly well- 
established, studying the growth of structure should be the major role of the MFs, which 
complement estimation of the N-point correlation functions. Finally, we can examine whether 
the current observational catalogues may be regarded as fair samples of the density distribution 
in the Universe. In reality, however, those primary goals may be achieved barely partially 
with the current dataset that we use below. Nevertheless our results shown below should be 
regarded as the most careful attempt towards the goals on the basis of the best observational 
data currently available. 

Focusing on morphological properties has two major advantages. First, there is a firm 
mathematical basis of morphological statistics developed in the field of integral geometry, which 
has condensed the complex information about morphology that is contained in all orders of the 
N-point correlation functions into a set of (i + 1 functionals, where d is the dimension of the 
spatial pattern (Mecke et al. 1994; Kerscher 2000 and references to the mathematical literature 
therein). Second, because this reduced set implicitly contains information from higher-order 
correlation functions, they provide constraints that are complementary to low-order correlation 
function analysis (e.g., Mecke et al. 1994, Schmalzing et al. 1999b). As a consequence of in- 
cluding higher-order correlation terms, the MFs also cover phase-correlations that are essential 
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for the formation of large-scale structure. In particular, the MFs are sensitive to features on 
spatial scales larger than typical scales (~ 5/i~^Mpc) accessible by the two-point correlation 
function, as demonstrated by Kerscher et al. 1998. 

A spatial distribution in three dimensions has four Minkowski Functionals, namely its 
volume, its surface area, the integrated mean curvature and the integrated Gaussian curvature, 
i.e., the Euler characteristic. The latter measures the same property as the Genus (Gott et al. 
1986, Melott 1988) which is a commonly used statistic in large-scale structure analysis. Using 
the whole family of MFs turns out to provide useful additional information: previous experi- 
ence already demonstrated that the total surface area [c.f., Ryden 1988; Ryden et al. 1989) 
and the integral mean curvature are more significant in discriminating structural differences. 
Furthermore, integral geometry provides robust formulae to deal with bounded data sets; in 
fact the boundary effect can only be exactly deconvolved by using the complete set of MFs (e.g., 
Kerscher et al. 1998). Useful 'shapefinders' can be constructed on the basis of isoperimetric 
ratios among the MFs (Sahni et al. 1998, Schmalzing et al. 1999a). 

A natural method to evaluate the MFs of a point-distribution, such as a galaxy cata- 
logue, is the so-called 'Boolean Grain Model', which decorates each point in the sample with, 
in the simplest case, spherical balls; the body formed by the union of these balls at a given 
scale (the radius of the balls) is then measured with MFs (Mecke et al. 1994). This method 
is usually preferable to the construction of a density field by smoothing and measurement of 
the morphology of isodensity contour surfaces, because the former has only one parameter 
(the radius) and does not require a smoothing kernel that discards information. However, the 
Boolean Grain Model has limitations when it comes to analyzing slice-like data such as the 
present sample of the SDSS data set: the largest Boolean Grain (sphere) is the one that fills 
the smallest extent of the sample and therefore the analysis is limited to that scale. For this 
reason we choose to smooth the point distribution with a Gaussian and construct excursion 
sets (isodensity contours) as a sensible way to study the morphology of large-scale structure in 
these data. Because the weak point in this analysis is the reliability of the constructed density 
field on a given smoothing scale, we employ two complementary methods of calculating the MFs 
of isodensity contour surfaces, which introduces a systematic control element into our analysis 
(Schmalzing and Buchert 1997). 

This paper is organized as follows. In Section 2 we explain and illustrate the SDSS data 
set we analyze and describe the mock catalogues used for comparison. In Section 3 we introduce 
the Minkowski Functionals, especially the computational methods of their calculation (details 
are provided in Appendix A). In this section we investigate systematic problems of the analyzed 
sample such as boundary effect, spatial resolution, and smoothing scale. Section 4 is devoted 
to the analysis of volume-limited samples, while the corresponding results for the apparent- 
magnitude limited samples are discussed in Appendix B. Among other issues we address is the 
fair sample question, including a convergence study by comparing with a previous data set. 
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Finally, Section 5 is devoted to a summary of the results and further discussion. 
2. SDSS Galaxy and N-body Mock Catalogues 
2.1. SDSS Sample 

York et al. (2000) provide an overview of the SDSS. Stoughton et al. (2002) describe the 
Early Data Release (EDR) and details about the measurements. Technical articles providing 
details of the SDSS include the description of the photometric camera (Gunn et al. 1998), 
photometric analysis (Lupton et al. 2002; Stoughton et al. 2002), the photometric system 
(Fukugita et al. 1996; Hogg et al. 2001; Smith et al. 2002), astrometric calibration (Pier et 
al. 2002), selection of the galaxy spectroscopic samples (Strauss et al. 2002; Eisenstein et al. 
2001), and spectroscopic tiling (Blanton et al. 2002). 

Our analysis of the MPs in the present paper is based on a subset of the SDSS galaxy 
redshift data, 'Large-scale Structure Sample 12' (Blanton et al. 2002). This sample includes 
galaxies with r-band magnitudes between 14.5 and 17.5 after correction for Galactic reddening 
using the maps of Schlegel, Pinkbeiner, & Davis (1998). 

A map of the galaxy distribution of the data that we analyze, together with a typical 
slice, are shown in Figures 1, 2 and 3. The three-dimensional map centered on us in equatorial 
coordinate system is shown in the upper panel of Figure 1. The lower panel in this figure shows 
the map projected onto the celestial sphere. The projected skymap for each Region is plotted 
in the survey coordinate system in Figure 2. Redshift slices of galaxies centered around the 
equatorial plane with various redshift limits and thicknesses of planes are shown in Figure 3: 
z < 0.05 with thickness of 10/i~^Mpc centered around the equatorial plane in the upper-left 
panel; z <0.1 with thickness of 15/i~^Mpc in the upper-right panel; z < 0.2 with thickness of 
20/;.~^Mpc in the lower panel. 

The data are mainly located in three regions, which we call Region 1, 2 and 3, respec- 
tively. The properties of each region covering the range of right ascension a and declination S 
of the equatorial coordinate system, and the number of galaxies are listed in Table 1. SDSS 
photometry is taken in driftscan mode along great circles along arcs of constant r] in the survey 
coordinate system (A,?]). The survey coordinate system (A,//) is suitable to describe the survey 
area, because each drift scan is perpendicular to a line of constant fi. The transformation 
between celestial and survey coordinates is 

cos(a; — 95°) cos 5 = — sin A 

sin(a — 95°) cos5 = cos Acos(?7 + 32.5°) (1) 
sin 6 = cos A sin(?7 + 32.5°) . 

In this paper we do not present the results of our analysis of the data located in Region 3 
(Southern sky) containing the three 2.5° width consecutive stripes separated from each other. 
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The reason is that no additional information on the morphology of three-dimensional structure 
could be deducted from this set due to its slice-like geometry. We also do not use the data in 
the discrete area in Region 2 (22.8° < A < 36.1°, 46.2° < f] < 51.3°) containing 3056 galaxies 
(Middle panel in Figure 2). 

Table 1. Properties of three main regions of the SDSS 'Sample 12', including the range of each region in the equatorial 
coordinate system (a, 5) and the survey coordinate system (A, 77) (Eq.[l]). 



Name 


a 




S 




A 






V 




Region 1 


129.7°- 


250.0° 


-3.74° - 


-6.25° 


-55.2° - 


-65.0 


323.7° 


- 333.8° 


55897 


Region 2 


112.0°- 


260.6° 


23.1° - 


68.8° 


-59.8°- 


' 60.4° 


16.2° 


-51.3° 


59766 


Region 3 


308.7° - 


^63.3° 


-11.3° - 


- 16.3° 


-57.7°- 


'55.8° 


130.6° 


- 159.9° 


38624 



Fig. 1. Upper: 3D rcdshift-space map centered on us, and its projection on the celestial sphere of SDSS 
galaxy 'Sample 12', including the three main regions listed in Table 1. Lower: Projected skymap of 
'Sample 12' galaxy data in equatorial coordinates {o:,S). 

Fig. 2. Upper: Projected skymap of 'Sample 12' galaxy data in the survey coordinate system (A, 77) in 
Region 1 {Top), Region 2 {Middle) and Region 3 {Bottom). 

2.2. Mock samples 

To test for several observational effects on the MFs, including the shape of the survey 
volume and the redshift distortion, we construct mock galaxy samples from a series of P^M A^- 
body simulations provided by Jing and Suto (1998). These simulations employ 256'^ particles 
m a (300/i-^Mpc)3 periodic comoving box using Gaussian initial conditions and a Cold Dark 
Matter (CDM) transfer function (Bardeen et al. 1986). We use the z = snapshot simulation 
data (for simplicity we neglect the light-cone effect) in two CDM cosmological models: Lambda- 
CDM (LCDM) with Qq = 0.3, Aq = 0.7, h = 0.7, T = 0.21, and (Xg = 1, and Standard-CDM 
(SCDM) with f2o = I7 Aq = 0, h = 0.5, T = 0.5, and as = 0.6, where h denotes the Hubble 
constant in units of 100km s~^Mpc~^, F is the shape parameter of the transfer function, and cTg 
is the r.m.s. density fluctuation amplitude at 8/i~^Mpc. To simulate the effect of the shape of 
the survey volume, we extract wedge samples (12 realizations in total) out of the full simulation 
cube so that they have the same sample-shape and number of particles as each volume-limited 
sample. To construct mock samples that extend beyond the simulation box size, we duplicate 
particles using the periodic boundary conditions. We also consider the redshift distortion effect 
by adding the line-of-sight component of the peculiar-velocity to each particle in the calculation 
of the redshift. 
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Fig. 3. Redshift slices of 'Sample 12' galaxy data around the equatorial plane. The rcdshift limits and 
the thickness of the planes are: Upper-left z < 0.05, 10/i~^Mpc; Upper-right z < 0.1, 15ft-^^Mpc; Lower 
z < 0.2, 20/i~^Mpc. The size of points has been adjusted. Note that the data for the Southern part are 
sparser than those for the Northern part, especially for thick slices (see Figure 1). 

3. The Minkowski Functionals (MFs) 

3.1. Mathematical Aspects of Minkowski Functionals 

The morphological properties of an n-dimensional pattern are completely described in 
terms of n + 1 quantities, which we call MFs. In the present analysis, we generate isodensity 
contours from the three-dimensional density contrast field 6 by taking its excursion set F^, 
i.e., the set of all points where the density contrast 6 exceeds the threshold level u. The four 
Minkowski Functionals Vk{iy) of the excursion set can be measured, and plotted as functions of 
the threshold u. All MFs can be interpreted as well-known geometric quantities, namely the 
volume fraction Vo(z^), the total surface area Vl(i^), the integral mean curvature ¥2(1'), and the 
integral Gaussian curvature, i.e., the Euler characteristic Vslu). 

All MFs can be expressed as integrals over the excursion set. While the first MF is 
simply given by the volume integration of a Heaviside step function 6 normalized to the total 
volume Vtot, 

Vo{u) = -^ [ d^xe{u-iy{x)), (2) 



V'tot Jv 

the other MFs, Vk{k = 1,2,3), are calculated by the surface integration of the local MFs, v^^'^ 
(Schneider 1978). The general expression is 

Vu{u) = ^f d'S{x)v'^\v,x), (3) 

Vtot JdF^ 

with the local Minkowski Functionals for k = 1,2,3 given by 

1 



6' 

1/1 1 



(4) 
(5) 



where Ri and R2 are the principal radii of curvature of the isodensity surface. 

For a 3-D Gaussian random field, the average MFs per unit volume can be expressed 
analytically as follows (Tomita 1990): 

V,{u) = i - Texp ( \dx, (7) 



2 V27r ^0 
2 A 




V,iu) = l^exp{-;-u'] (8) 
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V2{u) = l^uexp(-lu^, (9) 



3V27r V 2 

Vs{u) = ^{u'-l)exp[^-lu'], (10) 



where A = ^^Jaf/Qna'^, a = (5^)^/^, cti = (|V(5p)^/^, and 5 is the density contrast. 

In this paper we evaluate the MFs as a function of the threshold level. We employ two 
different definitions of this threshold: one is the density threshold level denoted by u^, which 
is given by the density contrast 6 divided by the r.m.s. density fluctuation a after smoothing, 
the other is labeled by z/f parameterizing the volume-fraction / (Gott et al. 1986), 

/ = ^ / e-"'/2rfx. (11) 



/27r. 

The meaning of each definition is discussed in the section of the results for the MFs with 
volume-limited samples (§4). 

3.2. Computational Methods 

We compute the MFs for density contrast fields from mock samples and SDSS galaxies. 
We use the cloud-in-cell (CIC) interpolation to assign survey galaxies and dark matter particles 
to densities defined on a 128 x 256 x 64 grid. We Fourier transform the density contrast field, 
multiply by the Fourier transform of a Gaussian window with smoothing scale Rq, 

WG(r) = ^i expf — ^1, (12) 

and then transform back to real space. 

The accuracy of the estimation of the MFs increases in a specific volume as we choose 
smaller values of Rq, because the amplitudes of the MFs are roughly proportional to the number 
of structures with typical scale Rq. On the other hand, in order to produce reliable results of 
the MFs, Rq should be bounded by a minimum value for each sample to satisfy the following 
criteria (Hoyle et al. 2002): 

1. Rq should be comparable to or larger than the mean separation of galaxies (using too 
small values of Rq reduces to spherical isodensity contours around each galaxy.) 

2. The r.m.s. density fluctuation at the smoothing scale of -Rg should be larger than the 
Poisson term, which is equal to the inverse square root of the galaxy number within a 
Gaussian ball of effective radius -Rq. 

3. Rq should be more than twice the mesh size, corresponding to the Nyquist resolution 
frequency. 

A useful way to check for systematic errors due to numerical approximations inherent 
to each numerical code as well as due to the influence of the survey boundary is to compare 
the results with another code based on a different computational method. There are several 
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interesting routines to estimate the MFs, (e.g., Sheth et al. 2003), however, we here use two 
well-studied complementary routines to compute the MFs of a grid density field. 

The first approach transforms the surface integrals in Equations (3) into volume integrals. 
The local Minkowski Functionals are expressed in terms of invariants formed from the first and 
second derivatives of the density contrast field (c/. Schmalzing & Buchert 1997 for an outline 
of this approach and Koenderink 1984, and Appendix A for the gory details). Hereafter we call 
this approach Koenderink invariants. 

The other routine is based on Crofton's formula (Crofton 1868) from integral geometry 
(Hadwiger 1957). A more detailed description of the method can be found in Schmalzing & 
Buchert 1997. This method, which we will refer to as Crofton's formula, is best suited for 
a pattern described as a set of empty and occupied cells of a cubic grid. The calculation of 
Minkowski Functionals then reduces to counting the elementary cells (in three dimensions, the 
points, lines, squares, and cells of the cubic lattice), that are occupied by the pattern. 

For the computation of the fourth MF V3, we also apply the CONTOUR 3D code 
(Weinberg 1988), which has been widely used to calculate the genus g = — V3 (e.g., Gott et al. 
1989). This code computes the integrated Gaussian curvature over the surface by considering 
the dataset to consist of cubic pixels, so the contour surface consists of faces, edges and vertices, 
and where all the curvature is concentrated at the vertices. The curvature integral is then 
calculated by summing the angle deficits (and subtracting the angle excesses) at the vertices. 

Although the above three methods are theoretically equivalent, the numerical results 
are not always the same. This is partly due to the finite cell size of the grid used for our 
computations. However, the main difference comes from the effect of the complicated shape 
of the survey boundaries. In the following analysis we test the effect of excluding part of the 
volume near the boundary from our analysis, to find the limited spatial region in which the 
results from all three methods are consistent with each other. 

3.3. Log-normal model as a standard of reference 

The genus for the distribution of dark matter in simulations has been found to be well- 
approximated by the log-normal model (Matsubara and Yokoyama 1996; Hikage et al. 2002). 
Therefore, we might expect that this model would provide a reasonable approximation to all 
of the MFs of the dark matter distribution. Matsubara and Yokoyama (1996) derived the 
genus expression assuming that the nonlinear density field of dark matter has a one-to-one 
correspondence to the primordial Gaussian field. Generalizing their formula for the genus, the 
expressions for MFs that obey the log-normal statistics are obtained by substituting z/ln and 
Aln for and A in Equations (7) to (10): 



\n[{l + u^a)VTT^] 



(13) 
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( 



a 



) 



1/2 



X. 



(14) 



Figure 4 shows the comparison between the log-normal model and the MFs of mock sim- 
ulations estimated from both the Koenderink invariants and Crofton's formula methods at 
-Rg = 3, 7, 20/i~^Mpc. All of MFs except for Vq are multiplied by the volume of a Gaussian 
ball at the radius of Rq, A/SttRq. The error bar represents the statistical error estimated from 
three realizations of the full simulation. We find that all MFs obtained with the two methods 
agree very well over a wide range of smoothing scales. The log-normal model nicely reproduces 
all of the four MFs for dark matter and thus the difference between the log-normal model and 
the MFs for dark matter is negligible in the analysis of our galaxy samples with 10^ ~ 10'^ times 
smaller volume (see Table 2) than that of the full cubic data. 

4. Analysis of Volume— limited Samples 

4.1. Construction of Volume-limited Samples 

In an apparent-magnitude limited catalogue of galaxies, the average number density of 
galaxies decreases with distance because only increasingly bright galaxies are included in the 
sample at larger distance. To avoid this systematic change in both density and galaxy luminosity 
we construct volume-limited samples of galaxies, with cuts on both absolute-magnitude and 
redshift (we do not attempt to correct for evolution in the galaxy population over these limited 
ranges of redshift). In this section we mainly study the dependencies on galaxy luminosity 
and morphological type of the MFs for the volume-limited samples, and compare with the 
corresponding mock catalogues. We construct six volume-limited samples in Region 1 and 
Region 2 covering ranges of absolute magnitude of width AMj. = 1 from —23 to —17, where the 
absolute magnitude is 



and where K{z) is the K-correction for the bandpass shift due to the redshift. We apply an 
approximate K-correction factor K[z) = 0.9z valid for the typical galaxy color of g — r = 0.65 
(Fukugita et al. 1995). The maximum (minimum) redshift is determined by the redshift at 
which the faintest (brightest) galaxies in each sample lies in the apparent-magnitude range 
14.5 < mj. < 17.5 (see Figure 5). The bright cutoff in this apparent magnitude range approxi- 
mates the three arcsecond aperture magnitude limit in the spectroscopic target selection (see 
Strauss et al. 2002 for details). We compute the comoving distance from the observed redshift 
z of each galaxy after correction for the Local Group motion. 



where is the matter density parameter and Aq is the dimensionless cosmological constant. 
Table 2 shows the redshift range and the total volume of each volume-limited sample. 



M, = m,- 51og[r(l + z)] + K{z) , 



(15) 




(16) 
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Fig. 4. Comparison of the MFs for dark matter using all particles in N-body simulations ((300/i~^Mpc)^) 
run by Jing & Suto (1998) with predictions of the log-normal model. Here we show estimates of the MFs 
using two methods: Crofton's formula labeled by (C) at Rq = 3/i^^Mpc (filled circles), 7/i~^Mpc (open 
triangles), and 20/i~"'^Mpc (crosses), and the Koenderink invariants labeled by (K) at each smoothing 
scale (solid lines). The log-normal model predictions (Eq: (13)) are plotted with dashed lines at each 
smoothing scale. All of MFs except for Vq are multiplied by the volume of a Gaussian ball at the radius 
of Rq, A/SttRq. Error bars estimated from three realizations of cubic data are shown on the results from 
Crofton's formula. 
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-23 
-22 
-21 
-20 
-19 
-18 
-17 

0.1 0.2 0.3 

Z 

Fig. 5. Schematic picture to show how we determine the redshift range of a volume-Umited sample with 
a given absolute-magnitude range. Six boxes represent the absolute-magnitude ranges and the redshift 
ranges of volume-limited samples listed in Table 2. Solid lines indicate the relation between and z 
(Eq. (15)) at apparent magnitudes m,- ~ 14.5 and 17.5. 

Table 2. Redshift range and survey volume in Regions 1 and 2 for each volume limited-sample with different magnitude 
ranges. 



sample name 


magnitude range 


redshift range 


total volume [{h ^Mpc)^] 
Region 1 Region 2 


A 


-23 < M- < 


-22 


0.098<z<0.218 


1.62 X 10^ 


1.86 X 10^ 


B 


-22 < Mr < 


-21 


0.064<z<0.148 


5.40 X 10^ 


6.17 X 10^ 


C 


-21 < A/,. < 


-20 


0.042 <z< 0.098 


1.66 X 10^ 


1.90 X 10^ 


D 


-20 < A/r < 


-19 


0.027<z< 0.064 


4.81 X 10^ 


5.50 X 10^ 


E 


-19 < A/r < 


-18 


0.017<2<0.042 


1.33 X 10^ 


1.52 X 10^ 


F 


-18 < A/,. < 


-17 


0.011 <z<0.027 


3.57 X 10-* 


4.08 X lO'' 



These galaxies are further classified into two morphological types, Early-type and Late- 
type, which basically correspond to E/SO and Sp/lrr, respectively. Shimasaku et al. (2001) 
found a tight correlation between the morphology and the inverse concentration index C[^, de- 
fined as the ratio of the half-light Petrosian radius to the 90 %-light Petrosian radius (Stoughton 
et al. 2002). We adopt the threshold values of = 0.35 for < 16.0, 0.359 for 16.0 < < 16.5, 
and 0.372 for 16.5 < rrij. < 17.0. Since the discrimination of morphology is very difficult for 
rrij. > 17.0, we count such faint galaxies neither as Early-type nor Late-type (this is why the 
number of galaxies for early and late types is different from that of all galaxies in Table 3). Table 
3 shows the properties of each volume-limited sample including the numbers and the mean sep- 
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Table 3. Properties of volume-limited samples with different absolute-magnitude range, including the number of galaxies 
and the mean separation for All, Early-type and Late-type galaxies, respectively. 



sample name 


galaxy type 


number of 
Region 1 


galaxies 
Region 2 


mean separation [h ^Mpc] 
Region 1 Region 2 


A 


All 


853 


907 


ZD. ( 


07 A 




Early 


592 


645 


on 1 
3U.1 


on T 




Late 


151 


131 


47.5 


coo 
OZ.Z 


B 


All 


6727 


6286 


y.o 


n o 

y.y 




Early 


3570 


3221 


ii.O 


1 O /I 




Late 


1938 


1699 


"1/11 

14. i 


15.4 


C 


All 


9783 


7750 


ti.O 






Early 


4049 


3178 


(A 


O A 




Late 


3849 


3276 


n a 
I X> 


o o 
o.o 


D 


All 


3677 


4844 


^ 1 
O.i 


A Q 

4.0 




Early 


987 


1256 


/ .y 


I .0 




Late 


2099 


2644 


6.1 


5.9 


E 


All 


1648 


1671 


4.3 


4.5 




Early 


245 


253 


8.2 


8.4 




Late 


1071 


1073 


5.0 


5.2 


F 


All 


686 


416 


3.7 


4.6 




Early 


63 


37 


8.3 


10.3 




Late 


422 


258 


4.4 


5.4 



arations for Early-type, Late-type and All (including the faint galaxies with 17.0 Krrij. < 17.5) 
in each region. 

Following the criteria for the smoothing scale Rq discussed above, we choose the three 
appropriate volume-limited samples and analyze them with smoothing lengths Rq = 3, 5, 7 and 
10/i~^Mpc, respectively, listed in Table 4. The amplitude of the the MFs for each smoothed 
density field varies with the effective number of resolution elements in the sample iVres (Vogeley 
et al. 1994): 

iV,es = Kot/(27r)3/2i?3^ , (17) 

where Vtot is the total survey volume in which we compute the MFs. The estimator iVres 
indicates the number of structures at a typical scale Rq in the smoothed field and is roughly 
proportional to the amplitude of the MFs (the sampling noise of the MFs is roughly proportional 
to the inverse square root of A^res)- We also list the value of A'rcs for each smoothed density 
field in Table 4. 

We calculate the r.m.s. fluctuation amplitude a for each smoothed density field. Table 
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Table 4. Properties of density fields for each volume-limited sample including the smoothing scale Rq, the abso- 
lute-magnitude range, the mesh size of the simulation box, the resolution number A^ros (Eq- (17)), and the r.m.s. density 
fluctuation amplitude cr of the SDSS galaxy number density field and the averaged cr with one-sigma error of the particle 
number density field in the mock samples. 





Kg 


sample 
name 


mesh size 


Region 1 Region 2 


a of SDSS 
Region 1 


galaxies 
Region 2 


cr of mock samples 
Region 1 Region 2 


3h- 


"1 Mpc 


D 


1.7 h~^Mpc 


1131 


1294 


1.46 


1.37 


1.38 ±0.06 


1.38±0.13 






E 


1.1 /i-^Mpc 


312 


357 


1.31 


1.26 


1.28±0.13 


1.38±0.16 






F 


0.7/1-1 


83 


95 


1.11 


1.23 


1.47±0.54 


1.68 ±0.58 


5h- 


"1 Mpc 


C 


2.6 /i-^Mpc 


843 


965 


1.13 


1.10 


0.94 ±0.03 


1.00 ±0.07 






D 


1.7 h-^Mpc 


244 


279 


1.02 


0.96 


0.94 ±0.04 


0.95 ±0.09 






E 


1.1 h-^Mpc 


67 


77 


0.91 


0.88 


0.89 ±0.10 


0.94±0.10 


7h- 


-1 Mpc 


C 


2.6 h-^Mpc 


307 


351 


0.93 


0.86 


0.72 ±0.03 


0.77 ±0.05 






D 


1.7 h-^Mpc 


89 


101 


0.77 


0.73 


0.70 ±0.04 


0.73 ±0.07 






E 


1.1 /i-^Mpc 


24 


28 


0.71 


0.69 


0.70±0.10 


0.71 ±0.07 


lOh 


"1 Mpc 


B 


3.8 h^^Mpc 


342 


391 


0.69 


0.77 


0.58±0.02 


0.62 ±0.03 






C 


2.6 /i-^Mpc 


105 


120 


0.74 


0.64 


0.53 ±0.02 


0.58 ±0.04 






D 


1.7 h-^Mpc 


30 


34 


0.56 


0.53 


0.51 ±0.04 


0.54 ±0.06 



4 shows that a of the SDSS galaxies agrees well with that of the dark matter samples within 
the standard deviation for all smoothing scales and samples with different magnitude ranges. 
This implies that the current data of SDSS galaxies may already approach a fair sample with 
respect to two-point measures on scales < 10h~^Mpc. We also find that luminous galaxies 
exhibit stronger clustering (larger a). 

Fig. 6. top: Illustration of a volume-limited sample C. Upper figures are redshift slice diagrams of 
Early-type (red), Late-type (blue), and Faint (17.0 < < 17.5) galaxies (black) for Region 1 (left) 
and Region 2 (right). Lower panels show redshift-space maps of the smoothed density field of the vol- 
ume-limited sample for All, Early-type and Late-type galaxies at Rq ~ Th^^Mpc. 

Figure 6 shows projected point distributions and contour plots of the density fields of 
the volume-limited samples smoothed at Rq = 7/i~^Mpc for Early-type, Late-type, and All 
galaxies, respectively (see discussion in Subsection 4.4). 

Differences between estimates of the MFs using Crofton's formula and the Koenderink 
invariants methods is shown in Figure 7. The methods yield consistent results except for V2. In 
that case the difference is minimized to an acceptable level by restricting the region of analysis 
to a minimum of two grid cells from the boundary. 

Here, two comments are in order: first, V2 is, according to many previous studies, the 
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Fig. 7. Comparison of tiie two mctliods used to compute the MFs of mock volume-limited samples of 
Region 1 . The plotted lines show the difference between the MF estimated using the Koenderink invariants 
method and the Crofton's formula method, normalized by the MF amplitude predicted by the log-normal 
model. Error bars indicate the standard deviation estimated from the mock samples. The region near 
the survey boundary that is excluded from the analysis varies from from zero (dotted lines), to one 
grid cell (dashed hues) or two grid cells (solid lines) from the survey edge. For the fourth MF, V-i{va), 
the difference between results from the CONTOUR 3D code (Weinberg 1998) and Crofton's formula is 
plotted with dashed lines. The smoothing scale is i?G = S/i^^Mpc, and the absolute-magnitude range is 
-20<A/r<-19. 
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most sensitive of the four MFs to the morphology of large-scale structure. When the survey 
of the north galactic cap is completed, V2 will be accurately measured. Second, we found that 
the agreement between both methods is better for all MFs at smaller smoothing lengths. For 
Vsi^^), which is the same as the genus except for the sign, we also plot the difference between 
the Crofton's formula results and the genus calculated by CONTOUR 3D (Weinberg 1998). 
The agreement between the two methods is also satisfactory. Therefore, we show only the 
estimation of the MFs based on Crofton's formula for all MFs in what follows. In order to 
effectively enlarge the analyzed sample region and so improve on the significance of the results, 
we averaged the values for the MFs for Region 1 and Region 2. In doing so, we also take into 
account cosmic variance, to which we devote a separate study in the next subsection. 

4-2. Cosmic Variance 

To examine whether Region 1 and Region 2 are fair samples for the purpose of measuring 
the MFs, we plot the difference between the observed MFs, V"obs,fe, and MFs from mock samples 
Knock,fc for Region 1 and Region 2 simultaneously in Figure 8. We normalize the difference by 
the Vln.i('^), which is the second MF predicted by the log-normal model (Eq. (13)) to remove 
the exponential damping nature of the MFs for large thresholds. The error bars on the line 
indicate the statistical error estimated from 12 realizations of the mock samples for each region. 

We also compare the results for 'Sample 12' (lower panel in Figure 8) with the results 
of the previous 'Sample 10' (upper panel) which contains ~ 30% fewer galaxies and has ~ 20% 
smaller survey volume than 'Sample 12'. We find that all of the four MFs marginally agree 
within the statistical error between Region 1 and Region 2 for 'Sample 12', while significant 
morphological fluctuations are seen in the MFs for Regions 1 and 2 of 'Sample 10'. 

This "convergence property" of our results by moving from 'Sample 10' to 'Sample 12' 
supports our methods of analysis and indicates that we obtained statistically reliable results. 
In the case of 'Sample 12' the variation of the MFs between Regions 1 and 2 is consistent with 
the uncertainties due to cosmic variance estimated using the mock samples. However, 'Sample 
10' still displays significant morphological discrepancies. As we learned from previous studies of 
the PSCz catalogue (Kerscher et al. 2001) compared with the IRAS 1.2 Jy catalogue (Kerscher 
et al. 1998; 1997), morphological fluctuations on these spatial scales can be large, and the 
'fair sample scale' may well be larger than 300/i~^Mpc. Our previous studies mentioned above 
also showed that the sample-to-sample variation in mock catalogues critically depends on the 
size of the simulation box; simulations with box-size of the mock catalogues we employed in 
the present investigation are not expected to be able to reproduce the large morphological 
fluctuations that could be present on the scales considered. In any case, this is one of the 
critical issues that can be answered more reliably with the future samples of SDSS galaxies. 
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Fig. 8. Comparison of the MFs for the volume-limited samples of SDSS galaxies, Region 1 (solid lines) 
and Region 2 (dashed lines). We plot the difference between the MFs of the observations and moek surveys 
of eaeh region, normalized by the amplitude of eaeh MP predicted from the log- normal model for 'Sample 
10' (upper panel) and 'Sample 12' (lower panel). Error bars indicate the standard deviation estimated 
from the mock results. The smoothing scale is Rq ~ 10/i~^Mpc, and the absolute-magnitude range is 
-20<A/r <-19. 
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Fig. 9. Comparison of the MFs of volume-limited samples with different absolute-magnitude ranges 
(open triangles, filled circles and crosses in decreasing orders of M,-). In the right set of panels, the lines 
represent the difference between the observed MFs and results for the mock samples from simulations, 
normalized by the amplitude of each MP predicted from the log-normal model. Sampling errors estimated 
from mock samples are plotted with the observational results. Averaged MFs of the mock samples are 
plotted for LCDM (solid lines) and SCDM (long dashed lines). Log-normal model predictions are also 
plotted with short dashed lines. Plotted lines are the average between Region 1 and 2 at Rq = 5h~^Mpc. 
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4-3. Luminosity Segregation 

In Figure 9 we plot the four MFs for the volume-hmited samples at Rq = 5/i^^Mpc 
with three different absolute-magnitude ranges, —21 < Mr < —20 (Sample C), —20 < Mj. < —19 
(Sample D), and —19 < Mr < —18 (Sample E) for Region 1 and 2 together. The properties 
of each volume-limited sample are listed in Table 4. The error bars on each observed MP 
represent the standard deviation estimated from the mock samples. For comparison we plot 
the averaged MFs for the mock samples with intermediate absolute-magnitude range, that is. 
Sample D for both LCDM and SCDM models. The analytical prediction of the log-normal 
model (Eq. (13)) is also plotted with a and ai calculated from the fitting formula of the power 
spectrum of the LCDM model by Peacock & Dodds (1996). 

Figure 9 addresses three issues: first, the comparison between the log-normal model and 
the simulated MFs, which corresponds to the MFs for dark matter, second, the comparison 
between the simulated MFs and the observed MFs, and third, the comparison among the 
observed MFs of galaxy subsamples of different luminosity. First we find that the log-normal 
predictions agree well with the MFs for dark matter, even though the shape of the volume is 
wedge-like. This means that the observational effects on the MFs, such as the survey shape and 
the redshift distortion, are negligible, at least in the samples that we analyze. These results are 
consistent with previous results on the genus analysis for the SDSS Early Data Release (Hoyle 
et al. 2002; Hikage et al. 2002). 

Next, we find that the observations agree better with the mock results of the LCDM 
model than with those for the SCDM model, for all of the MFs at all smoothing scales consid- 
ered. Assuming a scale-invariant linear bias between dark matter and galaxies, the agreement is 
consistent with the A-dominated spatially-flat model with random-Gaussian initial conditions. 

Finally, the luminosity dependence of the MFs is found to be small compared with the 
standard deviation. We plot the difference between the MFs of the observation and the mock 
samples (subtraction of the mock results from the observed MFs at each u) in the right panels 
of Figure 9. We find that there is no clear difference between the MFs for galaxy samples with 
different luminosities. Our results suggest that the sensitivity of the nonlinearity of the biasing 
to the galaxy luminosity is small, in other words, the biasing effect of galaxy luminosity is 
negligible, at least on the volume-scale of 'Sample 12'. 

Figure 10 shows the MFs as a function of Uf defined from the volume fraction (Eq. [11]). 
Instead of the log-normal model prediction, the analytical predictions in random-Gaussian 
statistics are plotted. All of our results favor the LCDM model with random-Gaussian initial 
conditions. 

4.4- Morphological Segregation 

In this subsection we perform a quantitative analysis of the dependencies of the MFs on 
morphological type of galaxies (c/. Figure 6). Similar to the analysis of luminosity segregation 
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Fig. 10. MFs as a function of ft for Rq = 5h ^Mpc in Region 1 and Region 2 together. (Compare Figure 
9 for the MFs as a function of i^o-). 
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Fig. 11. Comparison of tlic MFs of a volume-limited sample C of Early-type galaxies (open trian- 
gles), Late-type galaxies (crosses) and All galaxies (filled circles). Plotted lines are the average be- 
tween Region 1 and Region 2 smoothed at Rq ~ 5/i~^Mpc. The lower panel displays the difference 
between the observed and mock sample MFs, normalized by the amplitude of each MF predicted from 
the log-normal model. Sampling errors estimated from mock samples are plotted with the observa- 
tional results. Averaged MFs of the mock samples for the distribution of all galaxies based on LCDM 
are plotted with solid lines. Log-normal model predictions are also plotted with short dashed lines. 
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in the previous subsection, we here compare the MFs for a volume-hmited sample, Sample 
C, smoothed at 5/i~^Mpc for different morphological types: Early-type, Late-type, and All 
galaxies shown in Figure 11. For comparison we plot the MFs for dark matter based on the 
LCDM and SCDM models, as well as the log-normal predictions. 

In the lower panel of Figure 11, which shows the difference between the observed MFs 
and simulated MFs (the same as the lower panel of Figure 9), we find that the variations of 
the MFs for different galaxy type is small enough to conclude that morphological biasing of the 
MFs is negligible in 'Sample 12'. 

5. Summary and Discussion 

We present results of a detailed morphometric analysis of a sample of SDSS galaxy data 
containing 154844 galaxies ('Sample 12') employing the complete set of Minkowski Functionals 
(MFs). To test for systematic errors in calculating the MFs of isodensity surfaces, we employ 
two complementary computational methods for the MFs: Crofton's formula and Koenderink 
invariants. We find that these methods produce results that agree to within statistical errors, 
despite the complicated shape of the survey region. A possible exception to this agreement 
occurs for the most sensitive MF, V2, but even in this case the dependence on method becomes 
small when the computed region is limited to several pixels inward from the survey edge. 
By constructing volume-limited samples, in which the distribution of galaxies in predefined 
magnitude ranges is homogeneous, we conduct a detailed analysis of the observed MFs for 
galaxies within different magnitude ranges and for galaxies of different morphological type. We 
compare these results with those estimated for the MFs of mock samples drawn from N-body 
simulations. 

Let us summarize our results according to the three primary goals mentioned in 
Introduction. 

1) Primordial Gaussianity: The good match between the observed MFs and the mock 
predictions based on the LCDM model with the initial random-Gaussianity might be inter- 
preted to imply that the primordial Gaussianity is confirmed. A more conservative interpreta- 
tion is that, given the size of the estimated uncertainties, these data do not provide evidence for 
initial non-Gaussianity, i.e., the data are consistent with primordial Gaussianity. Unfortunately 
due to the statistical limitation of current SDSS data, it is not easy to put more quantitative 
statement concerning the initial Gaussianity, but this is definitely what we intend to conduct 
with the improved datasets available in near future. 

Moreover, in order to go further and place more quantitative constraints on primordial 
Gaussianity with upcoming data, one needs a more precise and reliable theoretical model for 
the MFs which properly describes the nonlinear gravitational effect possibly as well as galaxy 
biasing beyond the simple mapping on the basis of the volume fraction. A perturbative approach 
by Matsubara (1994) combined with an extensive simulation mock sample analysis may be a 
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promising strategy for this purpose. 

2) Evolution of galaxy clustering: Galaxy biasing is another source of uncertainty for re- 
lating the observed MFs to those obtained from the mock samples for dark matter distributions 
(e.g. Hikage et al. 2001). If LCDM is the correct cosmological model, the good match of the 
MFs for mock samples from the LCDM simulations to the observed SDSS MFs may indicate 
that nonlinearity in the galaxy biasing is relatively small, at least small enough that it does 
not significantly affect the MFs (The MFs as a function of remain unchanged for the linear 
biasing). Furthermore, we show that the dependence of the nonlinearity of the biasing on the 
luminosity and the morphological type of galaxies is also very small. Our observational results 
that the effect of the galaxy biasing on the MFs is small is consistent with the predictions of 
theoretical studies (e.g. Benson et al. 2001). 

3) Fair sample hypothesis: The observed morphological fluctuations inferred from the 
comparison of Region 1 and Region 2 of 'Sample 12' are indeed consistent with those exhibited 
in our mock catalogues. 

Thus our overall conclusion is that the simulations of the LCDM model with primordial 
random-Gaussianity reproduce the observed MFs within the statistical errors. Differences 
between the MFs for the mock samples drawn from simulations of the LCDM and SCDM 
models arise mainly from their power spectra. The present MFs' analysis places constraints on 
the cosmological models that are consistent with previous results using the two-point correlation 
function, but does not yet exploit the full advantage of the MFs as a complementary statistic 
to the conventional two-point correlation function, at least on the level of 'Sample 12'. 

While we found that the dependence of the MFs on the luminosity and the morphological 
type of galaxies is weak and both distributions are comparable within the statistical error, we 
believe that the systematic differences due to these galaxy properties (as suggested for the 
genus statistic by Hoyle et al. 2002) will certainly be detectable in the near future. 
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Appendix A: Calculating higher-order Minkowski Functionals from the Koenderink 
invariants of a scalar field 



As stated in the main text, the higher-order Minkowski Functionals V^, fi = l,...,d oi 
any pattern P G Q in d dimensions can be calculated by integrating the partial Minkowski 
Functionals f^(x) along the surface of the pattern dP: 

V,= [ d'-'Sv.i^), (18) 

JdP 

where d'^~'^S is the c?— 1-dimensional surface element at x. Unfortunately, this formula is of little 
practical use for evaluating the Minkowski Functionals of a pattern P^, given as the isocontour 
of a scalar field u(x), that is 

P, = {x|n(x) >z/}. (19) 

We aim to convert the surface integrals above into volume integrals of functions of the field and 
its derivatives. 

It is relatively simple to convert the surface integral itself; we have 

/ d'^-'S= f d'^x\\/u(x)\5(u(x)-u), (20) 
JdP,, Jn 

where Vm(x) is the gradient of the field and 6{u{x.) — u) is a. Dirac (5-function selecting only the 
surface of the pattern from the whole support Q. 

As far as the local Minkowski Functionals t'^(x) are concerned, we will restrict ourselves 
to the relevant case of three dimensions. Then, the local Minkowski Functionals are proportional 
to the mean and Gaussian curvatures, H and G, of the surface: 

^i(x) = ^ (21) 

v,{^) = ^H (22) 
Sir 

^i(x) = i-G. (23) 

In order to express the curvatures in terms of derivatives, we perform some textbook differential 
geometry. The surface around a point x can be parameterized locally with two parameters ta, 
a = 1,2. The isocontour of the field m(x) at threshold u is characterized by a simple implicit 
equation: 

z/ = n(x). (24) 

Taking the derivative of this equation with respect to the surface parameters ta yields a system 
of equations that allows us to determine the fundamental forms of the iso-surface. We have^: 

^ In the following, ordinary partial derivatives with respect to a coordinate in three dimensions are denoted 
by the index of the coordinate following a comma, while derivatives along the surface are denoted with 
the index of the surface parameter following a semicolon. Indices named a and b are used for the surface 
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= U.a = U,iXi-a, and = U-ab = U^ijXi-aXj-b + U^iXi-ab- (25) 

The first part of tlie system, 

= U^iXi-a, (26) 

can be solved to yield the tangent vectors x.^, a = 1,2 of the surface. We will use^ 

The scalar products of the tangent vectors form the entries of the first fundamental form of the 
surface, the metric tensor gab- We have 

gab = Xi-aXi-b = SabU,iU,i " U,aU,b- (28) 

In the following, we need the determinant g of the metric tensor, given as 

g = det gab = u'^^u^iU^i (29) 
and its inverse 

, 6abU\ + U aUh , , 

^ab ^ ,3 ^ ^3Q^ 

In order to obtain the second fundamental form of the surface, we need the second 
derivatives Xi-ab of the surface and the components rii of the normal vector. Equation (25) 
yields the condition 

for the second derivatives, which will turn out to be enough for our purposes. The normal 
vector is proportional to the gradient, and has to be normalized of course, so we choose 

(wjWj) / 

The minus sign directs the normal vector towards regions of lower density; this corresponds 
to the intuitive expectation that the maxima of the field should be surrounded by islands of 
isocontour. Finally, we obtain the pseudo-tensor 6^6, containing the components of the second 
fundamental form of the surface 

, 1^,i^aij'^,jk^bkl'^,l f'30\ 

Oab = '^iXi-ab = 7 TTT^J • l"^"^J 

{u,mU,mr'^ 

Again, we need the determinant b of this object. Some algebra leads to the result 

b = det b,, = u2^ ^^lkeimnU,U^U,jmU,kn _ (34) 



parameters and therefore run over 1,2, while indices named . . . run from 1 to 3. Summation over indices 
occurring in pairs is understood. 
^ In fact, this solution is not unique, reflecting the freedom of parameterization. However, as long as the 
tangent vectors are not colinear, the final results, namely the local Minkowski Functionals, are independent 
of the parameterization. 
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With these preparations, the mean curvature H and the Gaussian curvature G of the 
isocontour of a scalar field can be expressed in terms of the field's derivatives. In terms of the 
components of the first and second fundamental form, they are given as 

ff=V6^„ G=-. (35) 
9 

Putting the results from above into these formulae yields 

Appendix B: Analysis of Magnitude limited Samples 

As discussed above, in this paper we focus on the analysis of volume-limited samples 
to avoid the systematic effects of variation with redshift of the mean galaxy density and range 
of galaxy luminosity. However, the statistical error of the MFs becomes large in volume- 
limited samples due to the small number density or the small volumes of such samples. In 
apparent-magnitude limited samples we can make use of nearly all of the data. Our results for 
volume-limited samples suggest that the luminosity bias is small and, therefore, that it might 
be reasonable to analyze apparent-magnitude limited samples. To do so requires correction for 
the variation with redshift in the expected galaxy density. 

We compute the selection function (f){z), by integrating over the Schechter form of the 
luminosity function, 

X exp[-10°-^(*'*-^')]rfM , (38) 

with parameters for all galaxies measured in the °'^r band by Blanton et al. (2002) and those 
for Early- and Late-type galaxies measured in the r-band by Nakamura et al (listed in Table 
5). Note that the "'^r band corresponds to the r-band shifted to match their rest-frame shape 
dX z = 0.1. The limits of integration are the absolute-magnitude limits. 



[l + z)d,{z) 



5 log 



K{z). (39) 



lOpc 

We use apparent-magnitude limits mmax = 17.5, mmin = 14.5, and apply an approximate K- 
correction factor K{z) = 0.9z {K{z) = 0.9(^ — 0.1) for ^'^r band). Figure 5 shows the number 
density distributions of our samples of All, Early-type and Late-type galaxies (histograms) 
compared with fits from Equation (38) for both Region 1 and Region 2. We list the properties 
of the magnitude-limited samples with absolute-magnitude range from —23 to —17 in Table 6. 
The redshift range for each sample is determined to be the region where the mean separation 
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Table 5. Parameter values of the Schechter form of the luminosity function (Eq.(38)) for All, Early-type, and Late-type 
galaxies, where the ''■^r-band corresponds to the r-band shifted to match their rest-frame shape at z = 0.1. 



galaxy type 


band Af* — Slogj^g/i 


a 


04lO-2(/i-lMpc)-3) 


All 


-20.44 


-1.05 


1.49 


Early-type 


r -20.62 


-0.68 


0.67 


Late-type 


r -20.35 


-1.12 


1.09 
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Fig. 12. Number density distribution of our SDSS galaxy sample for All (solid histogram), 
Early — type (dotted histogram), and Late-type galaxies (dashed histogram) in Region 1 and 
Region 2, respectively. The estimations with the Schechter form of the luminosity function 
(Eq. (38)) fitted to the observation are also plotted for All (filled circles). Early-type (open tri- 
angles) and Late-type (crosses) galaxies. The redshift width (Az) of the histogram is 1/150. 



of galaxies, 0(2)"^/^, is smaller than the smoothing scale. To construct mock samples with the 
same redshift distribution, we randomly select dark matter particles in the wedge samples in real 
space according to the selection function to reproduce the number of observed SDSS galaxies 
in the current samples. To correct the galaxy density field for the variation with distance of 
the selection function, we weight each galaxy by the inverse of the selection function (f){z)~^ in 
redshift space (e.g., Rhoads et al. 1994; Vogeley et al. 1994). Table 7 lists A^^cs (Eq. (17)) 
and a of galaxies in each sample and a with one-sigma error estimated from the mock samples. 
For all of the smoothing lengths the table shows the general feature that clustering is strong 
for Early-type galaxies, and weak for Late-type galaxies. 

Figure 5 shows the comparison of MFs for All, Early — type and Late-type galaxies 
with error bars estimated from mock samples for each type of galaxies. The averaged MFs 
of mock samples for all galaxies in the LCDM model and the log-normal predictions are also 



28 









1 




Obs.Early- 






Obs.Lale _ 


0.8 


7 \ 


Obs.All - 






Mock.AIl - 


0.6 


\ --- 


Lognormai 


0.4 


- Mag. limit V 






Region 2 \ 




0.3 


- fic=7h-'Mpo \, 






: -23<Mr<-17 







" 


" 




-2 


2 





I 



_1 1 1 1 1 1 1 


Early — type - 
Late-type ! 
All J 


4 7/ - 




; 




" Mag. limit, Re| 


'ion 2 


: /?j=7h-'Mpc, 

" 


-23<Mr<-17 : 

" 



i 4x10-3 
<^ 2x10-3 

L-2X10-3 
ui 

CQ 
O 

Ct. -4x10-3 








-2 








-2 


2 


- V 

1 / 
/ J 




" 






Fig. 13. Upper: Comparison of the MFs between the magnitudc-hmited samples for All (filled cir- 
cles), Early-type (open triangles) and Late-type galaxies (crosses). Error bars, estimated from the mock 
results, are added to the observational results. The averaged MFs from mock samples for all galaxies 
are also plotted with solid lines. The log-normal model predictions (Eq. (13)) are plotted with dot- 
ted lines. Lower: Dependency of the galaxy type on the MFs for magnitude-limited samples of SDSS 
galaxies. Subtractions of the MFs at each of the mock results from the observed MFs are plotted for 
Early-type (dotted lines), Late-type (dashed lines) and All (solid lines) galaxies with error bars estimated 
from the mock results. We adopt the smoothing scale of Rq = 7/i^^Mpc and the data of Region 2. 
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Table 6. Properties of each magnitude— limited sample of SDSS galaxies including the galaxy type, the smoothing scale Rq, 
the redshift range, the number of SDSS galaxies in each sample A^g^i, and the mean number of mass particles in six mock 
samples JVpartido- 



type 




redshift range 


Regionl Region2 


A^particlc 

Regionl Region2 


All 


5h~ 




0.009 <z<0.088 


91 898 




1 8395 


21454 


All 




-lA/lnr 


0.008<z< 0.121 


■^971 4 


30931 


29243 


39950 


All 


lOh 


"^Mpc 


0.007<z< 0.150 


39416 


37512 


34803 


39689 


All 


20h 


-^Mpc 


0.006<z< 0.197 


43355 


41805 


38171 


42050 


Early 


7h- 


"^Mpc 


G.013<z< 0.082 


4657 


4765 


4672 


5309 


Early 


lOh 


"^Mpc 


0.009<z< 0.117 


8069 


7146 


8059 


9232 


Early 


20h 


-^Mpc 


0.007<z< 0.167 


9889 


9168 


10153 


11157 


Late 


5h- 


'^Mpc 


0.011<z< 0.049 


4158 


4390 


4555 


5532 


Late 


7h~ 


"^Mpc 


0.008 <z< 0.078 


7003 


7199 


8262 


9038 


Late 


lOh 


"^Mpc 


0.007<z< 0.104 


8734 


8444 


10415 


11852 


Late 


20h 


"^Mpc 


0.006<z< 0.145 


9761 


9372 


11891 


13077 



plotted. Focusing on the difference in morphological properties of the distribution due to 
the morphological type of galaxies, we plot the subtraction of the simulated results from the 
observed MFs for each type of galaxies, normalized by the MPs predicted from the log-normal 
model, in the lower panel of Figure 5. We conclude that also in the magnitude-limited samples 
the difference due to morphological type of galaxies cannot be appreciated, as was found for 
the volume-limited samples. 
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Table 7. Properties of the smoothed density field of each magnitude-limited sample including Rq, the mesh size of the 
simulation box, the galaxy type, A''i-os (Eq- (17)), the r.m.s. fluctuation cr of the SDSS galaxy number density field, and the 
averaged a with one-sigma error of the particle number density field in the mock samples. 



i?G 

(mesh size) 


type 


N^cs (volume fraction) 
Regionl Region2 


a of SDSS 
Regionl 


galaxies 
Region2 


a of mock samples 
Regionl Region2 




All 


464(0.71) 


486(0.65) 


1.17 


0.97 


1.07±0.13 


0.97±0.10 


(2.1/i-iMpc) 


Late 


112(0.59) 


112(0.52) 


0.86 


0.74 


0.99±0.12 


0.94±0.19 




All 


424(0.70) 


441(0.64) 


0.86 


0.88 


0.83 ±0.09 


0.80 ±0.07 




Early 


113(0.59) 


114(0.52) 


0.98 


0.74 


0.84±0.12 


0.80±0.10 




Late 


121(0.59) 


122(0.52) 


0.66 


0.64 


0.86±0.14 


0.81±0.11 


10/i-iMpc 


All 


182(0.46) 


177(0.39) 


0.63 


0.64 


0.66 ±0.05 


0.66 ±0.04 


(3.8/i-iMpc) 


Early 


65(0.36) 


62(0.30) 


0.73 


0.72 


0.63 ±0.07 


0.59 ±0.05 




Late 


48(0.33) 


45(0.27) 


0.66 


0.55 


0.59 ±0.07 


0.59 ±0.06 


20/i~iMpc 


All 


29(0.27) 


26(0.21) 


0.34 


0.45 


0.33 ±0.05 


0.35 ±0.04 


(5.3/i-iMpc) 


Early 


12(0.20) 


10(0.14) 


0.34 


0.43 


0.32 ±0.07 


0.31 ±0.06 




Late 


7(0.15) 


5(0.11) 


0.28 


0.30 


0.31 ±0.07 


0.32 ±0.07 
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